Corpus: cat_news_2020_300K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
Estats Units 1218 1240 1188 1.07
Regne Unit 654 641 638 1.03
Felip VI 421 362 343 1.30
Cures Intensives 171 159 159 1.08
Hong Kong 141 138 138 1.02
Medi Ambient 126 127 118 1.15
Sagrada Família 93 115 89 1.35
Nostra Senyora 120 110 110 1.09
der Leyen 113 101 99 1.16
Joan Serra Carné 58 56 54 1.11
Hard Rock 34 50 34 1.47
Buenos Aires 36 38 36 1.06
Emirats Àrabs 50 35 35 1.43
L’anàlisi d’Antoni 28 33 25 1.48
EN DIRECTE 32 30 29 1.14
ÚLTIMA HORA 32 29 28 1.18
Johns Hopkins 23 28 22 1.33
Lives Matter 29 28 28 1.04
Pròxim Orient 22 28 22 1.27
Stay Homas 26 27 25 1.12
1036 msec needed at 2024-08-22 02:11